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(54) Nonlinear filter for noise suppression in linear prediction speech processing devices 



(57) The invention relates to a linear prediction 
audio signal processing apparatus, such as a vocoder, 
including a nonlinear filter to attenuate the residual sig- 
nal used to excite a linear prediction synthesis filter. The 
nonlinear filter is capable of reducing the noise compo- 
nent in the signal while keeping only the periodic com- 



ponent of the speech signal. This feature enhances 
speech quality. The invention also extends to a novel 
method for processing a residual signal used to excite a 
linear prediction synthesis filter in order to attenuate 
wide band additive noise in the speech signal as con- 
structed by the synthesis filter. 
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Description 

FIELD OF THE INVENTION 

[0001 ] This invention relates to the field of processing 
audio signals, such as speech signals that have been 
compressed or encoded with a digital signal processing 
technique. More specifically, the invention relates to a 
method and an apparatus for nonlinear filtering a resid- 
ual signal capable of exciting a linear prediction synthe- 
sis filter to construct an audio signal. 

BACKGROUND OF THE INVENTION 

[0002] When an audio signal is compressed by an 
encoder, such as by a code excited linear prediction 
(CELP) type encoder the additive noise that may be 
present in the background when the audio signal is 
recorded, will be processed with the speech signal. This 
noise component is not desirable because it contributes 
to degrade the speech quality when a decoder proc- 
esses the compressed audio signal in order to build a 
replica of the original signal. In this context, reducing the 
noise component in the signal while keeping only the 
periodic component of the speech signal would greatly 
enhance the speech quality. 

[0003] At present, one of the techniques used for 
noise reduction is called center-clipping. With this tech- 
nique, distortions may be introduced into the speech 
signal due to a disturbance in the short-term correlation 
properties, or, viewed in the frequency domain, distor- 
tions in successive short-term spectra may result. In 
contrast, the LPC residual is spectrum flattened and 
minor nonlinear operations do not introduce significant 
changes in the spectral shapes. 
[0004] Thus, there exists a need in the industry to pro- 
vide a method and an apparatus for enhancing speech 
quality by reducing noise that may be present in the 
speech signal. 

OBJECTS AND STATEMENT OF THE INVENTION 

[0005] An object of the invention is to improve an 
audio signal processing device, such as a Linear Pre- 
dictive (LP) encoder or a LP decoder, by providing a 
means in the audio signal processing device to reduce 
the perceptual effect of noise in the audio signal. 
[0006] Another object of the invention is to provide a 
method for processing a residual signal capable of 
exciting a linear prediction synthesis filter to generate a 
replica of an audio signal, so as to reduce the percep- 
tual effect of noise in the audio signal output by the syn- 
thesis filter. 

[0007] The present invention provides a non-linear fil- 
ter comprising a residual signal processing means for 
generating a residual signal capable of exciting a linear 
prediction filter to generate a replica of an audio signal, 
said means comprising: means for attenuating an 



amplitude of the residual signal according to a transfer 
function which establishes a degree of amplitude atten- 
uation that varies in accordance with an amplitude of 
the residual signal. 

s [0008] In a further aspect the invention provides an 
improvement to an audio signal processing apparatus 
including means for generating a residual signal for use 
in exciting a linear prediction filter to generate a replica 
of an audio signal, the improvement comprising a non- 

10 linear filter that includes: 

an input for receiving the residual signal; 

a residual signal processing means coupled to said 

input for receiving the residual signal, said residual 

is signal processing means having a transfer function 
that causes an attenuation of the residual signal, 
said transfer function establishing a degree of 
amplitude attenuation that varies in a non-linear 
manner with the amplitude of the residual signal; 

20 and 

an output coupled to said residual signal process- 
ing means for outputting the residual signal altered 
by said residual signal processing means. 

25 [0009] In this specification, the term "coefficient seg- 
ment" is intended to refer to any set of coefficients that 
uniquely defines a filter function which models the 
human vocal tract. It also refers to any type of informa- 
tion format from which the coefficients may indirectly be 

30 extracted. In conventional vocoders, several different 
types of coefficients are known, including reflection 
coefficients, arcsines of the reflection coefficients, line 
spectrum pairs, log area ratios, among others. These 
different types of coefficients are usually related by 

35 mathematical transformations and have different prop- 
erties that suit them to different applications. Thus, the 
term "coefficient segment" is intended to encompass 
any of these types of coefficients. 
[0010] The "excitation segment" can be defined as 

40 information that needs to be combined with the coeffi- 
cients segment in order to provide a complete represen- 
tation of the audio signal. It also refers to any type of 
information format from which the excitation may indi- 
rectly be extracted. The excitation segment comple- 

45 ments the coefficients segment when synthesizing the 
signal to obtain a signal in a non-compressed form such 
as in PCM sample representations. Such excitation seg- 
ment may include parametric information describing the 
periodicity of the speech signal, an excitation signal as 

so computed by the encoder of a vocoder, speech framing 
control information to ensure synchronous framing in 
the decoder associated with the remote vocoder, pitch 
periods, pitch lags, gains and relative gains, among oth- 
ers. 

55 [001 1 ] The coefficient segment and the excitation seg- 
ment can be represented in various ways in the signal 
transmitted through the network of the telephone com- 
pany. One possibility is to transmit the information as 
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such, in other words a sequence of bits that represents 
the values of the parameters to be communicated. 
Another possibility is to transmit a list of indices that do 
not convey by themselves the parameters of the digi- 
tized form of the speech signal, but simply constitute 
entries in a database or codebook allowing the decoder 
of the vocoder to look-up this database and extract, on 
the basis of the various indices received, the pertinent 
information to construct the digitized form of the speech 
signal. 

[001 2] In the most preferred embodiment of this inven- 
tion, the non-linear filter stage is incorporated in the 
encoder stage of a CELP vocoder. In this type of voco- 
der, the incoming speech is digitized and used to gener- 
ate a spectrum-flattened residual signal by linear 
prediction. Periodicity is removed from the residual sig- 
nal through use of pitch prediction filter (open-loop pitch 
predictor) or the incoming signal is partially matched 
with the aid of past excitation passed through a pitch 
synthesis filter (closed-loop pitch prediction). Sections 
of the signal corresponding to vowels generally show 
strong pitch periodicity and therefore high pitch predic- 
tion gain. If adaptive and stochastic codebooks are 
used to synthesize a replica of the incoming signal, for 
sustained voiced segments the relative contribution of 
the adaptive codebook is higher than that of the sto- 
chastic codebook. Near the onset of the voicing, how- 
ever, where the past excitation may not have a strong 
periodic component, the stochastic codebook serves to 
generate the initial pulse and the adaptive codebook 
contribution is relatively much smaller. The linear-pre- 
diction analysis filter removes the short-time correlation 
from each frame of signal, with no concern regarding 
the periodicity of the residual generated. Small devia- 
tions from the periodicity of the speech signal may result 
in large aperiodicities in the residual signal. Such aperi- 
odicities are considered detrimental to the resynthesis 
of the signal with good quality. 

[0013] The non-linear filter along with a LPC inverse 
filter and a LPC synthesis filter is located at the outlet of 
a LPC analysis processor to alter the residual from the 
original PCM speech signal and noise input. The trans- 
fer function of the non-linear filter is such that only sam- 
ples having amplitude less than a predetermined 
threshold will be attenuated. The degree of attenuation 
is a non-linear function of the sample amplitude. The 
higher the amplitude, the higher the attenuation will be. 
This approach has been found to be particularly effec- 
tive in suppressing noise since samples of the residual 
signal that are below the amplitude threshold are, in all 
likelihood, noise. 

[0014] In a most preferred embodiment, the amplitude 
threshold can be varied to suit the speech signal/noise 
ratio in the speech signal. A convenient way to estimate 
the amplitude threshold, above which no alteration to 
the residual signal is effected, is to calculate the stand- 
ard deviation of the amplitude of a plurality of succes- 
sive samples in the residual signal. Typically, the 



standard deviation is calculated over a full residual sig- 
nal frame and the amplitude threshold value is then lin- 
early computed from it. This calculation is effected at 
every signal frame, thus allowing the amplitude thresh- 

5 old to be dynamically updated in accordance with the 
variations of the residual signal. 
[0015] As embodied and broadly described herein, 
the invention also provides a method for processing a 
residual signal capable of exciting a linear prediction fil- 

io ter to generate a replica of an audio signal, said method 
comprising the step of attenuating an amplitude of the 
residual signal according to a transfer function estab- 
lishing a degree of amplitude attenuation that varies in 
accordance with an amplitude of the residual signal. 

75 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0016] 

20 Figure 1 is a block diagram of the encoder stage of 
a CELP vocoder; 

Figure 2 is a bloc diagram of the decoder stage of a 
CELP vocoder; 

25 

Figure 3a is a graph illustrating the transfer function 
a linear filter; 

Figure 3b is a graph illustrating the transfer function 
30 of a center-clipping filter; 

Figure 3c is a graph illustrating the transfer function 
of a non-linear filter; 

35 Figure 4a is a graph showing a probability distribu- 
tion function of the amplitude of a speech signal 
where the signal/noise ratio is high; 

Figure 4b is a graph showing a probability distribu- 
te tion function of the amplitude of a speech signal 
where the signal/noise ratio is low; 

Figure 5 is a block diagram of a non-linear filtering 
apparatus functioning in accordance with the princi- 
45 pies of the invention and the method detailed in Fig- 
ure 6; 

Figure 6 is a flowchart of the method for performing 
signal processing in accordance with the invention; 

50 

Figure 7a is a block diagram of a prior art CELP 
encoder/decoder; 

Figure 7b is a block diagram of a CELP encoder uti- 
55 lizing the non-linear filter in accordance with the 
invention; 

Figure 7c is a block diagram of a CELP decoder uti- 
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lizing the non-linear filter in accordance with the 
invention; 

Figure 7d is a block diagram of an audio signal 
encoding apparatus utilizing the non-linear filter in 
accordance with the invention where the filter is 
separate from the encoder structure; 

Figure 7e is a block diagram of an audio signal 
decoding apparatus utilizing the non-linear filter in 
accordance with the invention where the filter is 
separate from the decoder structure; 

Figure 8 is a block diagram showing the implemen- 
tation of Figure 7b in more detail; 

Figure 9 is a block diagram showing the implemen- 
tation of Figure 7c in more detail; 

Figure 10 is a block diagram showing the imple- 
mentation of Figure 7d in more detail; 

Figure 1 1 is a block diagram showing the imple- 
mentation of Figure 7e in more detail; 

DESCRIPTION OF A PREFERRED EMBODIMENT 

[001 7] In communications applications where channel 
bandwidth is at a premium, it is essential to use the 
smallest possible portion of a transmission channel. A 
common solution is to compress the voice signal with an 
apparatus called a speech codec before it is transmitted 
on a RF channel. 

[0018] Speech codecs, including an encoding and a 
decoding stage, are used to compress (and decom- 
press) the digital signals at the source and reception 
point, respectively, in order to optimize the use of trans- 
mission channels. Codecs used specifically for voice 
signals are dubbed "vocoders" (for voice coders). By 
encoding only the necessary characteristics of a 
speech signal, fewer bits need to be transmitted than 
what is required to reproduce the original waveform in a 
manner that will not significantly degrade the speech 
quality. With fewer bits required, lower bit rate transmis- 
sion can be achieved. 

[001 9] A prior art speech encoder/decoder combina- 
tion is depicted in Figure 7a. A PCM speech signal is 
input to a CELP encoder 700 that processes the signal 
provided and produces representation of the signal in a 
compressed form. The compressed form comprises a 
coefficient segment and an excitation segment. The 
coefficient segment includes LPC coefficients. Those 
coefficients uniquely defines a filter function that models 
the human vocal tract. The excitation segment is 
defined as information that needs to be combined with 
the coefficient segment in order to provide a complete 
representation of the audio signal. Such excitation seg- 
ment may include parametric information describing the 



periodicity of the speech signal, a residual as computed 
by the encoder of a vocoder, speech framing control 
information to ensure synchronous framing in the 
decoder associated with the remote vocoder, pitch peri- 
5 ods, pitch lags, gains and relative gains, among others. 
[0020] This information is then used to reproduce a 
PCM speech signal, along with the noise, by a CELP 
decoder 702. 

[0021] The residual signal can be defined as the part 
10 of the speech signal that the encoder of the vocoder 
was not able to predict. The residual signal is a highly 
unpredictable waveform of relatively small power. The 
signal power divided by the power of the prediction 
residual is called the prediction gain. A normal value for 
is the prediction gain is approximately 20 dB. The residual 
is therefore often described as being "spectrum flat- 
tened". 

[0022] Code Excited Linear Prediction (CELP) vocod- 
ers are the most common type of vocoder used in 

20 telephony presently. Instead of sending the excitation 
parameters, CELP vocoders send index information 
that points to a set of vectors in an adaptive and sto- 
chastic code book. That is, for each speech signal, the 
encoder searches through its code book for the one that 

25 gives the best perceptual match to the sound when 
used as an excitation to the LPC synthesis filter. 
[0023] Figure 1 is a block diagram of the encoder por- 
tion of a generic model for a CELP vocoder. As can be 
seen from this Figure, the only input is the PCM speech 

30 signal embedded with noise. This signal is input to the 
LPC analysis block 100 and to the adder 102. The LPC 
analysis block 100 outputs the LPC filter coefficients for 
transmission on the communication channel and as 
input to the LPC synthesis filter 105 and 110. At the 

35 adder 102, the output of the LPC synthesis filter 105 is 
subtracted from the PCM signal. The result is sent to a 
perceptually weighted filter 125 followed by an error 
minimization processor 127 that outputs the pitch index 
that will be transmitted on the communication channel. 

40 Those pitch indices are also sent back to the adaptive 
codebook 115 and to the first gain calculator 135 to 
effect a backward adaptation procedure, thus select the 
best waveform from the adaptive codebook to match the 
input speech signal. The first gain calculator 135 out- 

45 puts the first gain indices to be transmitted over the 
communication channel and to be input to the multiplier 
137. The adaptive codebook 115 outputs the periodic 
component of the residual to the multiplier 1 37 whose 
output is sent to the LPC synthesis filter 105. 

so [0024] At the adder 1 1 2, the output of the LPC synthe- 
sis filter 1 10 is subtracted from the output of the adder 
1 02. The result is sent to the perceptually weighted filter 
130 followed by an error minimization processor 132 
that outputs the code index that is transmitted over the 

55 communication channel and also fed back to the sto- 
chastic codebook 120 and to the second gain calculator 
1 40. The second gain calculator 1 40 outputs the second 
gain index that will be transmitted over the communica- 
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tion channel. The second gain index is used in the mul- 
tiplier 142 with the output to the stochastic codebook 
120, which is the statistic component of the residual sig- 
nal. 

[0025] Figure 2 is a block diagram of the decoder por- 5 
tion of a generic model for a CELP vocoder. The com- 
pressed speech frame is received from a 
telecommunication channel and fed to the different 
components of the decoder. The LPC coefficients are 
fed to an LPC synthesis filter 210. The pitch index is fed 10 
to the adaptive codebook 200 that calculates the peri- 
odic component of the residual with input from the last 
calculated residual. Its output is then multiplied with the 
first gain index by the multiplier 202. The code index is 
input to the stochastic codebook 205 that calculates the 15 
stochastic component of the residual and its output is 
multiplied with the second gain index by the multiplier 
207. These two parts of the residual are then added in 
the adder 204 and fed to the LPC synthesis filter 210. 
The LPC synthesis filter then uses the LPC filter coeffi- 20 
cients and the calculated residual to produce speech 
signal that goes through some post processing 215 
before it is output, usually in a PCM sample form. 
[0026] A segment exhibiting strong voicing is 
assumed to contain two additive components in the 25 
spectrum-flattened residual, a strong periodic compo- 
nent, due to the major pulses of the vocal tract excitation 
and an aperiodic noise component. This noise compo- 
nent represents the effects of spectrum-flattened envi- 
ronmental noise as well as minor secondary excitation 30 
pulses of the speech signal. The object of this invention 
is to achieve a relative suppression of the aperiodic 
component of the signal and thereby enhance the har- 
monic structure of the resynthesized speech. This result 
is obtained by nonlinear filtering the residual component 35 
of the compressed speech signal. 
[0027] Previous work in this area dealt with the center- 
clipping technique for pitch lag determination. This work 
is covered in the article entitled "New methods of pitch 
extraction" by M.M Sondhi. The contents of this article 40 
are incorporated herein by reference. Center-clipping a 
speech signal corrupted by noise attenuates the noise 
component. However, distortions may be introduced 
into the speech signal due to a disturbance in the short 
term correlation properties, or, viewed in the frequency 45 
domain, distortions in successive short term spectra 
may result. An example of a center-clipping filter is 
given at Figure 3b. 

[0028] Another center-clipping technique was used by 
Taniguchi et al. To modify the adaptive codebook in so 
CELP coding and thereby achieve pitch sharpening and 
is described in "Pitch sharpening for perceptually 
improved CELP and the sparse-delta codebook for 
reduced computation". This article is hereby incorpo- 
rated by reference. 55 
[0029] A nonlinear filter, is mathematically expressed 
by a nonlinear equation. In the present invention this fil- 
ter attenuates the amplitude of the residual signal sam- 



ples to a degree that varies with the amplitude of the 
input signal, namely the residual signal that presumably 
contains noise. In general, the lower the amplitude, the 
higher the attenuation. The transfer function of a non- 
linear filter found satisfactory for the present invention is 
given by the following equation: 

y(n)=A(n)x(n) 

where 

A(nymin(\x(n)/k\ t 1) 

and x(n) and y(n) are sampled values of the input and 
output signals, respectively, and k is a suitable thresh- 
old value. 

[0030] Another suitable form for a nonlinear filter 
equation would be: 

A(n)=min(x 2 (n)/k, 1 ) 

An example of the filter characteristics is given in Figure 
3c. The nonlinear filter equations above are example of 
the type of filter that can be used in this invention. Com- 
paratively, a linear filter is one that can be mathemati- 
cally expressed by a linear equation and an example of 
the characteristics of such a filter is shown in Figure 3a. 
[0031 ] The details of constructing a non-linear filter in 
accordance with the characteristics above will not be 
described in detail here since such filters are generally 
known to those skilled in the art. 
[0032] Notice that below an amplitude threshold k, the 
input is modified according to the nonlinear equation 
and that above the threshold, the output is simply equal 
to the input. The threshold k can be correlated to the 
standard deviation for each of the residual signal 
frames. For instance k may be the standard deviation 
over the residual signal frame multiplied by a constant. 
The threshold value k is meant to be variable such that 
when the amplitude of the speech is high relative to the 
noise amplitude, the standard deviation is high as well. 
This situation is depicted in Figure 4a. Conversely, 
when the speech content is low relative to noise, the 
standard deviation is low as well. This situation is 
depicted in Figure 4b. This implies that when the resid- 
ual signal samples have high amplitude characteristics, 
the threshold will be high and only the larger amplitude 
signal samples will be retained after filtering, thus 
increasing the periodicity of the signal. When the resid- 
ual signal samples have low amplitude characteristics, 
then the threshold will be low, thus only very small com- 
ponents of the signal samples, mainly noise, will be fil- 
tered and the result will again be increased periodicity, 
hence improved speech quality. 
[0033] A possible embodiment for a nonlinear filtering 
apparatus as described above is depicted in Figure 5. 
The nonlinear filtering apparatus 500 has a threshold 
calculator 510, a residual sample buffer 515, a nonlinear 
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fitter 520 and a filtered residual buffer 525. One input is 
provided to the nonlinear filtering apparatus 500. It is 
the residual samples 535. The output is the result of the 
nonlinear filtered residual samples 540 using a linear 
computation of the standard deviation of the residual 
samples over a frame as the amplitude threshold. 
[0034] The two buffers (51 5 and 525) are simply tem- 
porary storage elements that keep the required informa- 
tion for a period equal to a speech frame. The threshold 
calculator 510 takes its information from the residual 
sample buffer and calculates the standard deviation for 
one PCM sample of the residual signal. It then calcu- 
lates the value /c, such as by multiplying the standard 
deviation value by a suitable constant. The threshold 
calculator 510 sends this information to the nonlinear fil- 
ter 520 that uses it as its threshold value. 
[0035] The flowchart of Figure 6 describes the method 
that implements a nonlinear filtering apparatus. At step 
600, the apparatus gets a 20 millisecond frame of 
speech signal embedded with noise in the PCM format. 
A residual is generated for each frame (step 605) and 
input to the buffer 515. The amplitude threshold for that 
sample is then calculated (step 610). The filter threshold 
is adjusted accordingly (step 615). The residual is input 
to the nonlinear filter (step 620) and the resulting output 
is a new residual (step 625). At step 630, the apparatus 
verifies if this is the last frame. If it is, the apparatus 
returns to step 600 to get the next 20 millisecond sam- 
ple. If it is not, the procedure is stopped. 
[0036] Four examples of locations in which the nonlin- 
ear filtering apparatus 500 may be introduced are given 
in Figures 7b to 7e. The nonlinear filter apparatus can 
be either implemented on the encoder side (as in Fig- 
ures 7b and 7d) or the decoder side (as in Figures 7c 
and 7e). 

[0037] Figure 7b depicts a proposed implementation 
of the nonlinear filtering apparatus 500 on the encoder 
side 704 when access to it is provided. Figure 7c 
depicts a proposed implementation of the nonlinear fil- 
tering apparatus on the decoder side 708 when access 
to it is provided. Figure 7d depicts a proposed imple- 
mentation when the nonlinear filtering apparatus 500 is 
placed before the encoder 712 when access to it is not 
provided. Figure 7e depicts a proposed implementation 
of the nonlinear filtering apparatus 500 after the 
decoder 718 when access to it is not provided. 
[0038] Figures 8 through 1 1 give a more detailed view 
of the possible implementation for the nonlinear filtering 
apparatus 500 and their descriptions are provided 
below. 

[0039] The most preferred embodiment is shown in 
Figure 8. If access is provided to modify the encoder, 
the nonlinear filtering apparatus 500 may be inserted 
along with a LPC inverse filter 800, that receives the 
LPC coefficients from the LPC analysis block 100 and 
outputs a residual signal, and a LPC synthesis filter 850 
as input to the adder 1 02. The output of the nonlinear fil- 
tering apparatus 500 is a modified residual that is input 



10 

to the LPC synthesis filter 850. The rest of the vocoder 
remains the same. The particular reason for which it is 
preferred is because it suppresses both coding and 
environmental noise without introducing signal delays. 

5 [0040] As shown in Figure 9, if access to the encoder 
712 is not provided, the nonlinear filtering apparatus 
500 can be used to provide a modified signal as the ref- 
erence to be matched. In this case a PCM speech sig- 
nal and its noise are input to a LPC analysis block 900 

10 that produces the LPC coefficient to input to the LPC 
inverse filter 905 that in turn produces a residual. The 
residual is nonlinear filtered (apparatus 500) and 
passed through a LPC synthesis filter (910) which pro- 
vides the new reference signal that is input to the LPC 

15 analysis block 100 and the adder 102. The additional 
processing required in this case will result in a signal 
delay. 

[0041] The implementations are also different if 
access is provided to the decoder or not. If it is, the non- 
20 linear filtering apparatus 500 is inserted immediately 
before the LPC synthesis filter 210 of the decoder 710 
as shown in Figure 10. 

[0042] When access to the decoder 71 8 is not availa- 
ble, the implementation is such as represented at Fig- 

25 ure 11. The decoder 718 produces a reconstructed 
signal along with its noise output. This signal is input to 
a LPC analysis processor 1100 which provides coeffi- 
cients to an LPC inverse filter 1 1 05 and a LPC synthesis 
filter 1110. The PCM signal is then passed through the 

30 LPC inverse filter 1 105 and a residual is produced. This 
residual is nonlinear filtered (apparatus 500) and then 
passed through an LPC synthesis filter 1110. The LPC 
synthesis filter 1110 reconstructs the speech signal with 
a filtered noise output. 

35 [0043] In other applications where digital speech 
transmission is not involved, the nonlinear filtering 
apparatus 500 can be used as a generalized noise sup- 
pressor. The embodiment would then be the same as in 
Figure 11. That is, the input a PCM speech signal 

40 embedded with noise and the output is a reconstructed 
signal with nonlinear filtered noise. The setup would 
involve a LPC analysis processor 1100, and a LPC 
inverse filter 1 105, a LPC synthesis filter 1110 and the 
nonlinear filtering apparatus 500. This embodiment also 

45 allows use of the noise suppressor as a pre-filter to 
other coding systems, reducing the environmental noise 
that has become mixed with the received speech signal. 
[0044] The above description of a preferred embodi- 
ment should not be interpreted in any limiting manner 

50 since variations and refinements can be made without 
departing from the spirit of the invention. The scope of 
the invention is defined in the appended claims and 
their equivalents. 

55 Claims 

1. A non-linear filter comprising a residual signal 
processing means for generating a residual signal 
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capable of exerting a linear prediction filter to gener- 
ate a replica of an audio signal, said means com- 
prising: 

means for attenuating an amplitude of the s 
residual signal according to a transfer function 
which establishes a degree of amplitude atten- 
uation that varies in accordance with an ampli- 
tude of the residual signal. 

10 

2. A non-linear filter as defined in claim 1, wherein, 
said residual signal processing means causes 
attenuation of samples of the residual signal having 
an amplitude not exceeding a certain threshold k. 

15 

3. A non-linear filter as defined in claim 2, wherein 
said transfer function is linear for samples having 
an amplitude exceeding said threshold k. 

4. A non-linear filter as defined in claim 2 or 3, wherein 20 
k is variable for each frame. 

5. A non-linear filter as defined in claim 4, wherein 
said residual signal processing means includes 
means for periodically re-computing a value for k. 25 

6. A non-linear filter as defined in claim 5, wherein 
said means for periodically re-computing a value for 
k includes means for computing a standard devia- 
tion of a plurality of samples of the residual signal. 30 

7. A non-linear filter as defined in claim 6, wherein the 
plurality of samples of the residual signal define a 
frame of the signal. 

35 

8. A non-linear filter as defined in claim 6 or 7, wherein 
said means for computing a standard deviation, 
effects a computation of a standard deviation over a 
frame of the residual signal. 

40 

9. A non-linear filter as defined in any preceding claim, 
wherein said transfer function is defined by: 

y(n)=A(n)x(n) 

45 

where 

A(n)-min(\x(n)A\ t 1) 

and x(n) and y(n) are sampled values of the input so 
and output signals, respectively, and k is the ampli- 
tude threshold value. 

10. An audio signal processing apparatus including 
means for generating a residual signal capable of 55 
exciting a linear prediction filter to generate a rep- 
lica of an audio signal, said means comprising a 
non-linear filter that includes: 



an input for receiving the residual signal; a 
residual signal processing means coupled to 
said input for receiving the residual signal, said 
residual signal processing means having a 
transfer function that causes an attenuation of 
the residual signal, said transfer function estab- 
lishing a degree of amplitude attenuation that 
varies in accordance with an amplitude of the 
residual signal; and 

an output coupled to said residual signal 
processing means for outputting the residual 
signal altered by said residual signal process- 
ing means. 

11. The audio signal processing apparatus as defined 
in claim 10, wherein said audio processing appara- 
tus is a voice encoder or a voice decoder. 

12. The audio signal processing apparatus as defined 
in claim 1 1 wherein said encoder or decoder is of a 
CELP type. 

13. The audio signal processing apparatus as defined 
in any one of claims 10, 11 or 12, wherein said 
audio processing apparatus includes a synthesis fil- 
ter coupled to said output. 

14. The audio signal processing apparatus as defined 
in claim 13, wherein said synthesis filter is a linear 
prediction filter. 

15. A method for processing a residual signal capable 
of exciting a linear prediction filter to generate a 
replica of an audio signal, said method comprising 
the step of attenuating an amplitude of the residual 
signal according to a transfer function establishing 
a degree of amplitude attenuation that varies in 
accordance with an amplitude of the residual sig- 
nal. 

16. A method as defined in claim 15, comprising the 
step of causing attenuation of samples of the resid- 
ual signal having an amplitude not exceeding a cer- 
tain threshold k. 

17. The method as defined in claim 15 or 16, wherein 
said transfer function is linear for samples having 
an amplitude exceeding said threshold k. 

18. The method as defined in claim 16 or 17, wherein k 
is variable. 

19. The method as defined in claim 18, comprising the 
step of periodically re-computing a value for k. 

20. The method as defined in claim 19, comprising the 
step of computing a standard deviation over a plu- 
rality of samples of the residual signal to compute a 
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value for k. 

21. The method as defined in claim 20, wherein the plu- 
rality of samples of the residual signal define a 
frame of the signal. s 

22. The method as defined in claim 20, wherein said 
step of computing a standard deviation over a plu- 
rality of samples of the residual signal to compute a 
value for k includes the procedure of effecting a io 
computation of a standard deviation over a frame of 

the residual signal. 

23. The method as defined in any one of claims 16 to 

22, wherein said transfer function is defined by: 15 

y(n)=A(n)x(n) 

where 

20 

A(n)=min(\x(n)/k\,1) 

and x(n) and y(n) are sampled values of the input 
and output signals, respectively, and k is the ampli- 
tude threshold value. 25 
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speech quality. The invention also extends to a novel 
method for processing a residual signal used to excite a 
linear prediction synthesis filter in order to attenuate 
wide band additive noise in the speech signal as con- 
structed by the synthesis filter. 



500 



510 



520 
Nonlinear flftar 



CO 

< 

CO 
v- 

CD 

o> 

CO 

o 

Q_ 

LU 



sample output 



Primed by Xerox (UK) Business Services 
2.16.7/3.6 



BNSDOCID: <EP_ 



EP 0 899 718 A3 



J 



European Patent 
Office 



EUROPEAN SEARCH REPORT 



Application Number 

EP 98 20 2812 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 



Citation of document with indication, where appropriate, 
of relevant passages 



Relevant 
to claim 



CLASSIFICATION OF THE 
APPLICATION (lnt.Cl.6) 



D,A 



P,X 



WO 97 00516 A (NOKIA MOBILE PHONES LTD 
; NOKIA MOBILE PHONES UK LTD (GB); 
JARVINEN) 3 January 1997 (1997-01-03) 

* figure 3 * 

* page 4 * 

* page 12 - page 13 * 

US 5 133 013 A (MUNDAY EDWARD) 
21 July 1992 (1992-07-21) 

* abstract; figure 4 * 

MAN MOHAN S0NDHI: "New Methods of Pitch 
Prediction" 

IEEE TRANSACTIONS ON AUDIO AND 
ELECTR0AC0USTICS, 

vol. AU-16, no. 2, June 1968 (1968-06), 

pages 262-266, XP002112239 

IEEE 

* page 264 - page 265 * 

MERMELSTEIN P ET AL: "Nonlinear filtering 
of the LPC residual for noise suppression 
and speech quality enhancement" 
IEEE WORKSHOP ON SPEECH CODING FOR 
TELECOMMUNICATIONS. BACK TO BASICS: 
ATTACKING FUNDAMENTAL PROBLEMS IN SPEECH 
CODING, 7-10 September 1997, pages 
49-50, XP002112240 
IEEE, New York, NY, USA, ISBN: 
0-7803-4073-6 

* the whole document * 



1,10,15 



G10L3/02 



1,10,15 



1,10,15 



1-23 



TECHNICAL FIELDS 
SEARCHED (lnt.CI.6) 



G10L 



The present search report has been drawn up for all claims 



Place ol search 

THE HAGUE 



Date ot completion of the search 

13 August 1999 



Examiner 

Ramos Sanchez, U 



CATEGORY OF CITED DOCUMENTS 

X : particularly relevant if taken alone 

Y : particularly relevant if combined with another 

document of the same category 
A : technological background 
O : non-written disclosure 
P : intermediate document 



T : theory or principle underlying the invention 
E : earlier patent document, but published on, or 

after the filing date 
D : document cited in the application 
L : document cited for other reasons 

& : member of the same patent family, corresponding 
document 



2 



BNSDOCID: <EP 089971 8A3_I_> 



EP0 899 718 A3 



ANNEX TO THE EUROPEAN SEARCH REPORT 
ON EUROPEAN PATENT APPLICATION NO. 



EP 98 20 2812 



This annex lists the patent family members relating to the patent documents cited in the above-mentioned European search report. 
The members are as contained in the European Patent Office EDP file on 

The European Patent Office is in noway liable for these particulars which are merely given for the purpose of information. 

13-08-1999 



Patent document 
cited in search report 



Publication 
date 



Patent family 
member(s) 



Publication 
date 



WO 9700516 



03-01-1997 



US 5133013 



21-07-1992 



AU 


6230996 


A 


15- 


01- 


•1997 


CA 


2224688 


A 


03- 


-01- 


1997 


CN 


1192817 


A 


09- 


■09- 


1998 


EP 


0832482 


A 


01- 


-04- 


■1998 


AT 


101767 


T 


15- 


-03- 


1994 


CA 


1332626 


A 


18- 


-10- 


•1994 


DE 


68913139 


D 


24- 


-03- 


1994 


DE 


68913139 


T 


23- 


-06- 


1994 


EP 


0367803 


A 


16- 


-05- 


•1990 


WO 


8906877 


A 


27- 


-07- 


•1989 


GB 


2220330 


A,B 


04- 


-01- 


•1990 


HK 


121496 


A 


19- 


-07- 


1996 


JP 


2503256 


T 


04- 


-10- 


•1990 



2 



uj For more details about this annex : see Official Journal of the European Patent Office, No. 12/82 



BNSDOCID: <EP_ 



_089971BA3_I_> 



